Distance Functions for Reproducing Kernel Hilbert Spaces 
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Abstract. Suppose H is a space of functions on X. If H is a Hilbert space 
with reproducing kernel then that structure of H can be used to build distance 
functions on X. We describe some of those and their interpretations and 
interrelations. We also present some computational properties and examples. 



1. Introduction and Summary 

If H is a Hilbert space with reproducing kernel then there is an associated set, 
X, and the elements of H are realized as functions on X. The space H can then 
be used to define distance functions on X. We will present several of these and 
discuss their interpretations, interrelations and properties. We find it particularly 
interesting that these ideas interface with so many other areas of mathematics. 

Some of our computations and comments are new but many of the details 
presented here are known, although perhaps not as well known as they might be. 
One of our goals in this note is to bring these details together and place them in 
unified larger picture. The choices of specific topics however reflects the recent 
interests of the authors and some relevant topics get little or no mention. 

The model cases for what we discuss are the hyperbolic and pseudohyperbolic 
distance functions on the unit disk D. We recall that material in the next section. In 
the section after that we introduce definitions, notation, and some basic properties 
of Hilbert spaces with reproducing kernels. In Section 4 we introduce a function 
<5, show that it is a distance function on X, and provide interpretations of it. In 
the section after that we introduce a pair of distance functions first considered 
in this context by Kobayshi and which, although not the same as 5, are closely 
related. In Section 6 we discuss the relation between the distance functions that 
have been introduced and distances coming from having a Riemannian metric on 
X. The model case for this is the relation between three quantities on the disk, the 
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pseudohyperbolic distance, its infinitesimal version, the Poincare-Bergman metric 
tensor, and the associated geodesic distance, the hyperbolic metric. 

Several of the themes here are common in recent literature on reproducing 
kernel Hilbert spaces. Some of the results here appear in the literature as re- 
sults for Bergman spaces, but in hindsight they extend almost without change to 
larger classes of Hilbert space with reproducing kernel. Also, many results for 
the Hardy space suggest natural and productive questions for reproducing kernel 
Hilbert spaces with complete Nevanlinna Pick kernels. Those spaces have substan- 
tial additional structure and S then has additional interpretations and properties. 
That is discussed in Section 7. 

Section 8 focuses on computations of how S changes when H is replaced by 
a subspace. Although that work is phrased in the language of distance functions 
it can be seen as yet another instance of using extremal functions for invariant or 
co-invariant subspaces as tools to study the subspaces. 

Finally, we would like to emphasize that the material we present has not been 
studied much and most of the natural questions one can ask in this area are open. 

2. Distances on the Unit Disk 

Here we collect some background material; references are [G], [MPS], and 

m 

The pseudohyperbolic metric, p, is a metric on the unit disk, D, defined by, for 

z,w e D, 

' z — w 



p(z,w) = 



1 — zw 

Given any distance function a we can define the length of a curve 7 : [a, b] — > D by 

C(t) = SU P \ X! cr (7(*i))7(*i+i)) ■ a = t < h < ... < t n = b> . 

Using this functional we can define a new distance, a*, by 

a* (z, w) = inf {-£(7(7) : 7 a curve joining z to w} . 

Automatically a* > a and if equality holds a is called an inner distance. More 
generally a* is referred to as the inner distance generated by a. 

The distance p is not an inner distance. The associated p* is the hyperbolic 
distance, /3, which is related to p by 

/3 = log — — , p = - tanh/3. 
1 — p 2 

The hyperbolic distance can also be obtained by regarding the disk as a Riemannian 
manifold with length element 

2\dz\ 



ds 



l-\zf 



in which case /3(z,w) is the length of the geodesic connecting z to w. 

The Hardy space, H 2 = H 2 (B) , is the Hilbert space of functions, f(z) — 
S^Lo a nZ n which are holomorphic on the disk and for which ||/|| = ^2 l a «l < °°- 
The inner product of / with g(z) — ^6„z™ is (/, g) — J2 a nb n - The Hardy space 
is a Hilbert space with reproducing kernel. That is, for each £ € D there is a 
kernel function £ H 2 which reproduces the value of functions at (; V/ € H 2 . 
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{f,k(} — /(C)- It is straightforward to see that there is at most one such function 

and that k^ (z) = (l — Qz) 1 has the required property. 

For the Hardy space now, and later for a general reproducing kernel Hilbert 
space, we are interested in the functional 8 (•, •) , defined for (z,w) e D x I, by 



(2.1) S(z,w)=S H ,(z,w) = Jl- 

For the Hardy space this is evaluated as 



k v 



\kz\\ \\kw\\ 



(2.2) 6 m (z,w) = Jl 



(1-|*I)(1-M 2 ) 



|1 — zw\ 2 



1 — zw 



This can be simplified using a wonderful identity. For z,w € 

(2.3) ^(i-l^q-H 2 ) 

|1 — zw\ 
Hence 6^2 (z, w) — p{z, w). 

3. Reproducing Kernel Hilbert Spaces 

By a reproducing kernel Hilbert space, RKHS, we mean a Hilbert space H of 
functions defined on a set X together with a function K(-, •) defined on X x X with 
two properties; first, Vx € X, k x {-) = K(-,x) e H, second V/ G H (f,k x ) = f{x). 
The function k x is called the reproducing kernel for the point x. We will use the 
following notation for unit vectors in the direction of the kernel functions. For 
x G X we set 

i k x 

w 

General background on such spaces can be found, for instance, in |AMj . Here 
we will just mention three families of examples and collect some standard facts. 

3.1. Examples. 

The Dirichlet-Hardy-Bergman Family 

For a > let H a be the RKHS of holomorphic functions on D with the repro- 
ducing kernel 

K a (w, z) = k a , z (w) = (1 - zw) a . 
For a — there is the limit version 

K (w,z) = k , z (w) = -j- log 1 _ . 

zw 1 — zw 

We have not included normalizing multiplicative constants as part of the kernels; 
we will be dealing only with expressions similar to (12.11) which are not affected by 
such constants. Also, we have not specified the norms for the spaces. In fact we 
will need to know how to take inner products with kernel functions but we will 
never need exact formulas for norms of general functions in the spaces. Hence we 
will give Hilbert space norms for the function spaces which are equivalent to the 
intrinsic RKHS norms. 

First we consider the case a > 1. These are generalized Bergman spaces; / € T~L a 
if and only if 

ii/n« a ~ / / i/wi 2 (i-ki 2 r~ 2 ^'<oo. 
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The case a = 2 is the classical Bergman space. 

If a = 1 we have the Hardy space described earlier. In that case the norm can 
be given using the radial boundary values f*{e l9 ) by 

\\f\\k = / \r(e w )\ 2 de. 

An equivalent norm for the Hardy space is 

ll/ll^ H/(0)| 2 + / / \f{z)\ 2 {l-\z\ 2 )dxdw. 
J Jb 

The second description of the norm for the Hardy space is the one which generalizes 
to a < 1. For < a < 1, / is in W a exactly if 

\\ff Ha H/(0)| 2 + / / \f'(z)f(l-\z\ 2 ) a dxdw<^. 
J Jo 

The space Hq is the Dirichlet space and the H a for < a < 1 are called generalized 
Dirichlet spaces. 

The Fock-Segal-Bargmann Scale 

For (3 > let Fp be the Hilbert space of holomorphic functions on C for which 

\\f\\% p ~ J J c \f(z)\ 2 e-^ 2 dxdy<^. 
This is a RKHS and the kernel function is given by 

Remark 3.1. There are other families of RKHS for which the kernel functions are 
powers of each other and still others where such relations hold asymptotically, see, 
for instance [E] , [JPRj . 

General Bergman Spaces 

Suppose f2 is a bounded domain in C or, for that matter, C™. The Bergman 
space of f2, B(fl), is the space of all functions holomorphic on f2 which are square 
integrable with respect to volume measure; / e £?(£!) exactly if 

Wf\\ 2 B { n)= J Jjf(z)\ 2 dV(z)<^. 

In this case it is easy to see that B(£l) is a Hilbert space and that evaluation at 
points of f2 are continuous functionals and hence are given by inner products with 
some kernel functions. However, and this is one of the reasons for mentioning 
this example, it is generically not possible to write explicit formulas for the kernel 
functions. 

3.2. Multipliers. Associated with a RKHS H is the space M(H) of multi- 
pliers of H, functions m defined on X with the property that multiplication by m 
is a bounded map of H into itself. For m e M (H) we will denote the operator of 
multiplication by m by M m . The multiplier norm of m is defined to be the operator 
norm of M m . 

For example, the multiplier algebra M(B(Q)) consists of all bounded analytic 
functions on fi. The multiplier algebra of any of the spaces Fp consists of only the 
constant functions. 
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3.3. Background Facts. Suppose that H is a RKHS on X with kernel func- 
tions {k x } x£X an d multiplier algebra M(H). The following are elementary Hilbcrt 
space results. 

Proposition 1. Suppose / € H, \\f\\ < l,x € H. The maximum possible value 
of Re/(z) (and hence also of |/(z)|) is the value ||fc 2 || = fcj(z) 1 / 2 attained by the 
unique function / = k z . 

Proposition 2. There is a unique F z>w € H with H < 1 and F Z:W (z) = 

which maximizes Re F z ^ w (w). It is given by 



F z , w (•) = 

and it has 



kw ( ■ ) \\k z \\ 


k w (z) k z (■) 






2 


\\k w \\ sjl- 


{^kzi k w \ 








2 


= \\k w \\\Jl- 


k z , k w \ 





Proposition 3. For m G M(H),x e X we have (M m )* k x — m (x)k x 

Proposition 4. Suppose to € M(H),x,y € X. If ||-^m|lM(H) — ^ an< ^ K 1 ) = 
then 



k x , 



4. The Sine of the Angle 

Suppose we have a RKHS H of functions on X and we want to introduce 
a metric on X that reflects the properties of functions in H. There are various 
ways to do this, for instance we could declare the distance between x, y G X to 
be 

. Here we focus on a different choice. Motivated by, among other 
things, the modulus of continuity estimates in Proposition [5] and Proposition 2] we 
define, if neither k x nor k y is the zero function, 



(4.1) 



S(x,y) = 5 H (x,y) = \ l - 



k x , k t 



We don't have a satisfactory definition of 8{x, y) if k x or k y is the zero function. 
Either declaring these distances to be 1 or to be would lead to awkwardness later. 
Instead we leave 6 undefined in such cases. However we will overlook that fact and, 
for instance write 

Vsc, y £ X, 6 Hl (x, y) = S H2 (x, y) 
to actually mean that the stated equality holds for all x,y for which both sides are 
defined. 

One way to interpret i5 is to note that, by virtue of the propositions in the 
previous section, 

sup {\f(y)\ :f£H, ||/|| = l,f(x)=0} 



5h{x,v) = 



sup{|/(y)| :feH, ll/H = 1} 



Also, 5(x,y) measures how close the unit vectors k x and k y are to being parallel. 
If 8 is the angle between the two then 5(x,y) — \/T— cos 2 9 = \smO\ . 
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In fact 8 is a pseudo-metric. It is clearly symmetric. It is positive semidcfinitc 
and will be positive definite if H separates points of X. (Although we will consider 
spaces which do not separate all pairs of points we will still refer to 6 as a metric.) 
The triangle inequality can be verified by a simple argument [AMI Pg. 128]. 
Instead we proceed to computations which develop further the idea that S measures 
the distance between points in the context of H. A corollary of the first of those 
computations is that 8 satisfies the triangle inequality. 

For a linear operator L we denote the operator norm by ||L|| and the trace 
class norm by ||£|| Traco • If L is a rank n self adjoint operator then it will have real 
eigenvalues {Aj}™ =1 . In that case we have 



=su P {|A !; |}, 



Trace 



E 



Ai| , Trace (L) =J2 X * 



Also, recall that if L is acting on a finite dimensional space then Trace (L) equals 
the sum of the diagonal elements of any matrix which represents L with respect to 
an orthonormal basis. 



Proposition 5 (Coburn [C02J). For x, y € X let P x and P y be the self adjoint 
projections onto the span of k x and k y respectively. With this notation 



S(x,y) = \\P x -P y \\ = -\\P x -P, 



V 1 1 Trace ' 



PROOF. P x and P y are rank one self adjoint projections and hence have trace 
one. Thus the difference, P x — P y , is a rank two self adjoint operator with trace 
zero and so it has two eigenvalues, ±A for some A > 0. Thus \\P X — P y \\ = A, 
\\Px — -PyllTracc = ^A- We will be finished if we show 2S(x,y) 2 = 2A 2 . We compute 



2A 2 = Trace ((P x - P y 

= Trace (P x + P y - P x P y - P y P x ) 
= 2 - Trace (P x P y ) - Trace {P y P x ) 
= 2 - 2 Trace (P x P y ) . 

Going to last line we used the fact that Trace (AB) — Trace (BA) for any A, B. We 
now compute Trace (P x P y ) . Let V be the span of k x and k y . P x P y maps V into 
itself and is identically zero on V 1 - hence we can evaluate the trace by regarding 
P x P y as an operator on V, picking an orthonormal basis for V, and summing the 
diagonal elements of the matrix representation of V with respect to that basis. We 
select the basis k y and j where j is any unit vector in V orthogonal to k y . Noting 
that P y k y — k y and P y j = we compute 



Trace (P x P y ) = 



PxPykyi kyj + (P X Pyj,j^ 

P x k yi k y ) +0 



k y , k x j k x , k y 



k k 



which is what we needed. 



□ 
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Remark 4.1. Because we actually found the eigenvalues of P x and P y we can 
also write 8 in terms of any of the Schatten p-norms, 1 < p < oo; 5(x,y) — 

2-^\\P X -Py\\ Sp . 

A similar type of computation allows us to compute the operator norm of the 
commutator [P a ,P b ] = P a Pb — PbPa- Informally, if a and b are very far apart, 
S(a,b) ~ 1, then each of the two products will be small and hence so will the 
commutator. If the points are very close, 5(a, b) ~ 0, the individual products will 
be of moderate size and almost equal so their difference will be small. 

Proposition 6. \\[P a ,P b }\\ 2 = S(a,b) 2 (l - S(a,b) 2 ) . 

Proof. Note that [P a ,P(,] is a skew adjoint rank two operator of trace 
and hence has eigenvalues ±i\, for some A > 0. Hence ||[P a ,Pt]|| = A. Also, 
[P a , Pb]* [P a , Pb] is a positive rank two operator with eigenvalues A 2 , A 2 so its trace 
is 2A 2 . We now compute 

[Pa, Pb}* [Pa,Pb] = PaPbPbPa ~ PaP b P a Pb ~ P b PaP b Pa + P b PaP a Pb 
= PaPbPa - PaPbPaPb ~ P b PaP b Pa + P b P a P b 

Recalling that for any A, B, Trace (AB) = Trace (BA) and also that the projections 
are idempotent we see that 

Trace (P a P 6 P a ) = Trace (P a P b ) = Trace (P b P a ) = Trace (P 6 P a P 6 ) . 

The two middle quantities were computed in the previous proof 

Trace (P a P b ) = Trace :/',,/•„: = ( /,•„ . /,-,, ; " 
We also have 

Tracc(P a P b P a P b ) = Trace (P b P a P h P a ) 

We compute the trace of the rank two operator PbP a PbP a by summing the diagonal 
entries of the matrix representation of the operator with respect to an orthonormal 
basis consisting of k a and j, a unit vector orthogonal to k a . 



TvS.Ce {PbPaPbPa) = [PbPaP b Paka,ka) + (P b P a PbPa] , j) 

(4.2) = (p b P a P b k a X) + 

Next note that 

PbPaPbh = (k a ,k b ) PbPak 

= (k a ,kb^j (kb,ka^j Pb ka 

= (k a , fc b ) ( fc b , k a ) (k a , kb) kb 
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and hence we can evaluate (|4.2[) and obtain 



k a ^ kl 



Thus 



| [P a , P b ] || 2 = - Trace ([P a , P b ]* [P a , P b ]) 

= - (2 Trace (P a P b ) - 2 Trace {P b P a P b P a )) 





2 


















'-(>- 


(ha-, kb^ 


1 


(^ka-) kb^ 



5(a,b) 2 (1 -5{a,bf 



□ 



Hankel forms: The projection operators of the previous section can be written 
as P a — k a )(k a . In some contexts these are the primordial normalized Toeplitz 
operators. More precisely, on a Bergman space these are the Toeplitz operators 
with symbol given by the measure ||fc a || 2 S a and general Toeplitz operators are 
obtained by integrating fields of these. There is also a map which takes functions 
on X to bilinear forms on H x H. The basic building blocks for that construction 
are the norm one, rank one (small) Hankel forms given by L a = k a ® k a , thus 

L a (f,g) = (g-k^j = f(a)g(a). 

Limits of sums of these, or, equivalently, integrals of fields of these; are the Hankel 
forms on H; for more on this see |JPR] . 

The norm of a bilinear form B on H x H is 



\B\ 



S up{\B(f,g)\:f,geH,\\f\\ 



1}. 



Associated to a bounded B is a bounded conjugate linear map (3 of H to itself 
defined by (f,j3g) — B(f,g). If we then define a conjugate linear (3* by (/?*/, g) = 
(fig,f) then 

Thus /3*/3 is a positive linear operator. The form norm of B equals the operator 

1 /2 

norm of (ft* p) and we define the trace class norm of B to be the trace of the 

1 /2 

positive operator (/3*/3) . With these definitions in hand we have a complete 
analog of Proposition [5j 



Proposition 7. For x.y E X 



1 



8(x,y) = \\L X - L y \\ = - \\L X - L 



V 1 1 Trace ' 



PROOF. Let f3 x and f3 y be the conjugate linear maps associated with L x and 

L y . One computes that (3 x f — (3*f = (k a , f \ k a and similarly for (3 y . Using this 

one then checks that for any x, y; (3y(3 x = P y P x . Thus the proof of Proposition [5] 
goes through. □ 



5. Formal Properties 

We collect some observations on how the metric S interacts with some basic 
constructions on RKHS's. 
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5.1. Direct Sums. If H is a RKHS of functions on a set X and J is a RKHS 
on a disjoint set Y then we can define a RKHS (H, J) on X U Y to be the space of 
pairs (h,j) with h e H,j e J regarded as functions on X U Y via the prescription 



(h,j)(z) = 

One then easily checks that 



h(z) if 2 g X 
j(z) ifzeY. 



S H {z,z') if z,z'eX 
S(h,j)(z,z') = { 6j(z,z') i£z,z'€Y . 

1 otherwise 

That computation is fundamentally tied to the fact that H and J are sets of 
functions on different spaces. If, however, all the spaces considered are functions 
on the same space then it is not clear what the general pattern is. That is, if H is a 
RKHS on X and if J, J' are two closed subspaces of H with, hoping to simplify the 
situation, J _L J' then there seems to be no simple description of the relationship 
between Sh,Sj,Sj>, and 5j®j>. In some of the examples in a later section we will 
compute these quantities with J' = J x but no general statements are apparent. 

5.2. Rescaling. Suppose H is a RKHS of functions on X and suppose that 
G(x) is a nonvanishing function on X; G need not be in H and it need not be 
bounded. The associated rescaled space, GH, is the space of functions {Gh : h e H} 
with the inner product 

(Gf,Gg) GH = (f,g) H . 

It is straightforward to check that GH is an RKHS and that its kernel function, 
Kqh is related to that of H, Kh by 



K GH {x lV )=G{x)G{y)K H {x,y). 

An immediate consequence of this is that 8 does not see the change; 5qh = $h- 

Elementary examples of rescaling show that certain types of information are 
not visible to 6. Suppose we rescale a space H to the space cH for a number c 
(that is; |c| 2 (f,g) cH = (/:<?)#)• The natural "identity" map from H to cH which 
takes the function / to the function / will be, depending on the size of c, a strict 
expansion of norms, an isometry, or a strict contraction. However it is not clear 
how one can recognize these distinctions by working with 5h and S c h- 

An awkward fact about rescaling is that sometimes it is present but not ob- 
viously so. Consider the following two pair of examples. First, let H be Hi, the 
Hardy space of the disk. This can be realized as the closure of the polynomials 
with respect to the norm 



For a weight, a smooth positive function w{9) defined on the circle, let T-L\^ w be 
the weighted Hardy space; the space obtained by closing the polynomials using the 
norm 



2 f 27r I 

5>*V„=y E 



a k e ike 



2 <^ d6 
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This is also a RKHS on the disk and in fact these two spaces are rescalings of each 
other. However to see that one needs to use a bit of function theory. The crucial 
fact is that one can write w{6) as 

w{6) = \W(e l6 )\ 2 

with W(z) and 1/W(z) holomorphic in the disk and having continuous extensions 
to the closed disk. The functions W ±x can then be used to construct the rescalings. 

We now do a similar construction for the Bergman space. That space, H2 in 
our earlier notation, is the space of holomorphic functions on the disk normed by 

II/IIL= / / \f^)\ 2 dxdy. 
J Jo 

A weighted version of this space is given by replacing dxdy by w(z)dxdy for some 
smooth positive bounded w. To make the example computationally tractable we 
suppose w is radial; w(z) = v{\z\). We define the weighted Bergman space W.2,w by 
the norming function 

H/llL„= / / \f(z)\ 2 w(z)dxdy. 
J Jo 

The space ^2,™ is a RKHS on the disk and is an equivalent renorming of "H 2 but is 
not related by rescaling. One way to see this is to note that, because the densities 
1 and w(z) are both radial, in both cases the monomials are a complete orthogonal 
set. Thus, in both cases, the kernel function restricted to the diagonal is of the 
form 

00 , ,2n 

K(z, z) = J2 7-^2 = a o + ai \A +■■■■ 

\\ z II 

Hence we can compute that for z near the origin 

5(o,z) = ||m(i + o(M 2 )). 

If the spaces were rescalings of each other then the coefficients || 1|| / ||z|| would have 
to match, but this is not true independently of the choice of w. 

5.3. Products of Kernels. In some cases the kernel function for a RKHS 
has a product structure. We begin by recalling two constructions that lead to that 
situation. Suppose that for i = 1, 2; Hi is a RKHS on Xj. We can regard the Hilbert 
space tensor product Hi ® H2 as a space of functions on the product Xi x X2 by 
identifying the elementary tensor hi ® h 2 with the function on X\ x X 2 whose value 
at (xi, X2) is hi(xi)h,2{x2) ■ It is a standard fact that this identification gives H\®H2 
the structure of a RKHS on X\ x X2 and, denoting the three kernel functions by 
Ki, K2, and we have 

Ki,2 {{xi,x 2 ) , (x[,x' 2 )) = K x (xi,x[)K 2 (x2,x' 2 ) . 

Now suppose further that X\ = X 2 and denote both by X. The mapping of x e X 
to (x, x) e X x X lets us identify X with the diagonal D C X x X and we will use 
this identification to describe a new RKHS, -H12, of functions on X (now identified 
with D). The functions in if 12 are exactly the functions obtained by restricting 
elements of Hi <g) H2 to D. The Hilbert space structure is given as follows. For 
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/, g € -Hi 2 let F and G be the unique functions in H± <S> H2 which restrict to / and 
g and which have minimum norm subject to that condition. We then set 

Ui9)h 12 = ( f ^ G )h 1 ^h 2 ■ 
To say the same thing in different words we map H12 into Hx®H% by mapping each 
h to the unique element of H\ (8 H2 which restricts to h and which is orthogonal to 
all functions which vanish on D. The Hilbert space structure on _ff 12 is defined by 
declaring that map to be a norm isometry. It is a classical fact about RKHSs that 
K12, the kernel function for H12, is given by 

K12 (x, y) = K\{x, y)K 2 (x, y). 

This leads to a relatively simple relation between the distance functions 8\, 82 and 
5\2 which we now compute. Pick x,y € X. We have 



K 12 (x,x)Kx 2 (y,y) 
= iK.jx^)] 2 \K 2 (x,y)\ 2 

K 1 (x,x)K 1 (y,y) K 2 (x,x)K 2 (y,y) 
= (l-Sl (x,y)) (l- 8 2 2 (x,y)). 

Hence 

S 12 = ^81 + 51-8151 
This implies the less precise, but more transparent, estimates 
(5.1) max{<5i, 5 2 } < 812 < 81 + 8 2 ; 

with equality only in degenerate cases. Similar results hold for £1,2, the distance 
associated with Hi (8 H% . 

A particular case of the previous equation is H\= i?2-In that case 



812 = yw? - Si > St. 

This monotonicity, which for instance relates the 8 associated with the Hardy space 
with that of the Bergman space, is a special case of the more general fact. If we 
have, for a set of positive a, a family of spaces H a and if there is a fixed function 
K so that the kernel function K a for H a is K a then there is automatically a 
monotonicity for the distance functions; if a' > a then S a > > 5 a . This applies, for 
instance, to the families {H a } and {Fp} introduced earlier. We also note in passing 
that in those two families of examples there is also a monotonicity of the spaces; 
if a < a' then there is a continuous, in fact a compact, inclusion of 1-L a into H. a '\ 
similarly for the Fp. 

6. The Skwarcynski Metric 

In K] Kobayshi studies the differential geometry of bounded domains, Q, in 
C n . He begins with the observation that there was a natural map of f2 into the 
projective space over the Bergman space of f2. He then notes that either of two 
naturally occurring metrics on that projective space could then be pulled back to 
where they would be useful tools. However looking back at his paper there was 
no particular use made of the Bergman space beyond the fact that it was a RKHS. 
We will now describe his constructions and see that they give expressions which are 
not the same as 5 but are closely related. 



12 



N. ARCOZZI, R. ROCHBERG, E. SAWYER, AND B. D. WICK 



Suppose H is a RKHS of functions on X. Canonically associated with any 
point x £ X is a one dimensional subspace of H, the span of the kernel function k x , 
or, what is the same thing, the orthogonal complement of the space of functions 
in H which vanish at x. The projective space over H, P(H), is the space of one 
dimensional subspaces of H. Hence for each x £ X we can use the span of k X: \k x \ 
to associate to x a point p x — [k x ] £ P(H). To understand the geometry of this 
mapping we break in into two steps. First, we associate to each x £ X the set 
of vectors in the unit sphere, S(H) that are in the span of k x ; all these vectors 
of the form e k x for real 9. The next step is to collapse this circle sitting in the 
unit spheres, \e w k x : 9 £ M} , to the single point p x = [k x ] . In fact every point 

p £ P{H) is associated in this way to a circle C(p) C S(H) and distinct points 
correspond to disjoint circles. We now use the fact the distance function of H 
makes S(H) a metric space and use that metric to put the quotient metric on 
P{H). That is, define a metric 8 on P(H), sometimes called the Cayley metric, by: 

S(p, q) = inf {||r - s\\ : r £ C{p),s £ C{q)} 

On the subset {p x : x £ X} we have explicit descriptions of the circles C(p x ) 
and we compute 



S{Vx,Py) = inf | 



inf A /2-2Ree i ( e -'J) (k x ,k 



V2J1 



k k 

™x 1 



We now pull this metric back to X and, with slight abuse of notation, continue to 
call it 5\ 



(6.1) 



6(x, y) = V2^Jl - (k x ,k y 



Thus the map of X into P(H) which sends x to p x is a isometry X with the metric 
5 into P{H) with its metric as a quotient of S(H). 

This metric was studied further by Skwarcyhski in S and then in collaboration 
with Mazur and Pflug in [ MPS] . In |JPj it is referred to as the Skwarcyhski metric. 

The second metric Kobayshi introduces in this context is again obtained by 
putting a natural metric on P(H) and then, again, pulling it back to X. The Fubini- 
Study metric is a natural Kahler metric in finite dimensional projective space and 
Kobayshi extends that definition to the generally infinite dimensional P(H). We 
denote by S the metric obtained by restricting the Fubini-Study metric to the image 
of X in P{H) and then pulling the metric back to X. 

In the small these three metrics are almost the same. If one is small so are the 
others and, in fact, setting S(x,y) = Si, 5(x,y) = 82, S(x,y) = S3 we have that 



(6.2) 



max iSA = O(e) 

i=l,2,3 



max {\Si — S 

i,j=l,2,3 



,|} = 0(e J ). 



The comparison between <5i and S2 follows from (|4.1j) . (|6.1j) and Taylor's theorem. 
The comparison of 82 and £3, which is not difficult, is given in [Kl pg. 282]. 



DISTANCE FUNCTIONS FOR REPRODUCING KERNEL HILBERT SPACES 



13 



Comparing (|4.1|) and (|6.1|) also allows one to see that 

lim J^A_ = l 
5{x <y )^i $(x,y)/V2 

Remark 6.1. In mathematical physics, for certain choices of H, the map from X 
into the projective space is related to coherent state quantization. In that context 
some of the quantities we have been working with, or will be below, are given 
physical names/interpretations. For instance | < k x ,k y > | 2 = 1 — 5{x,y) 2 is the 
probability density for transition from the quantum state [k x ] to the state [k y ] . See, 
for instance, [Ol], [02], [AE], and [PV] . 



7. Differential Geometric Metrics 

In this section we describe the relationship between the distance functions we 
introduced 5, the associated length functions and inner metrics, and a Riemannian 
metric built using the kernel functions. Throughout this section we suppose that H 
is a RKHS of holomorphic functions on a domain X in C. We further suppose that H 
is nondegenerate in the sense that Vx, !/£!, 3h, k € H with h(x) ^ 0, k(x) ^ k{x). 
These restrictions are much more than is needed for most of what follows but some 
restrictions are necessary. For instance the results in the next subsection require 
that the kernel function K(x, y) be sufficiently smooth so that one can apply to it 
the second order Taylor's theorem; some of the results in the third subsection are 
specific to one complex variable. 

7.1. Results of Mazur, Pflug, and Skwarcynski. In an earlier section 
we described how, for distance functions on the unit disk, one could pass from 
a distance to the associated inner distance. That discussion was not specific to 
the disk and we now apply those constructions to distances defined on X. That 
is, given a distance function V we define the length of a curve 7 by iv(j) = 
SU P ^(7(^)7(^+1))} arl( i the inner distance induced by V is given by T>*(x, y) = 
inf {£v("f) ■ 7 is a curve from x to y} . Clearly T>* >T> and if the two functions are 
equal we say T> is an inner distance. For example Euclidean distance on the plane is 
an inner distance; the pseudohyperbolic distance in the disk is not an inner distance, 
its induced inner distance is the hyperbolic distance. 

The reproducing kernel for the Bergman space of the disk, H2 = B(D), the 
Bergman kernel is K(x,y) = (1 — xy) . Using it we can construct a Riemannian 
metric on the disk through 

ds 2 = ^—\ogk z {z) \dz\ 2 = -k\ogk z (z) \dz\ 2 
azaz 4 

2 2 

~ — \ dz \ 

l-\z\ 2 

which is a constant multiple of the density we saw earlier when introducing the 
hyperbolic metric on the disk. 

More generally, if X is a bounded domain in C and H is the Bergman space 
of X, H = B(X), and {k z } are the reproducing kernels then the formula ds 2 = 
dd\ogk z (z) \dz\ defines a Riemannian metric on X, the so called Bergman metric. 
There is also an extension of this construction to domains in C n . 
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In MPS] Mazur, Pflug, and Skwarcyriski prove three theorems. Suppose that 
X is a bounded domain the C (they actually work with C"). Let H be the Bergman 
space of X. For a curve 7 in X let £b(j) be its length measured using the Bergman 
metric. For x,y G X denote the Bergman distance between them by 8B(x,y) = 
inf {^3(7) : 7 a smooth curve from x to y} . 

Proposition 8 ([K], jMPSQ . 

(1) For any smooth curve 7 



(2) 
(3) 



4(7) = hii) = 4f(7) = 5M7). 



<r = <r = <5* = s = ~s B . 

2 



Vx, y € X, x^ y, 5(x, y) < 8* (x, y) . 



In particular (up to a constant factor) the Bergman distance is the inner dis- 
tance generated by our metrics and the metric 6 is never an inner metric. 

The results for 6 are proved in |MPS| (with an unfortunate typo in the state- 
ment of Theorem 2 there). We noted that locally the three distances in (|6.2p agree 
to third order. Hence the three metrics generate the same length function and same 
inner distance. Because of this the results for 8 and 5 follow from the ones for 8. 
The equality 6* — 8, i.e. the statement that 8 is an inner distance, follows from the 
discussion in [K] , 

The discussion in [K] [MPSj is given for Bergman spaces, B(fl). However the 
results hold in more generality. Given X and H of the type we are considering, 
with reproducing kernels {k z } , there is a standard associated Riemannian metric 
given by 

(7.1) ds%= (J^logk z (z)\ \dz\ 2 . 

If we define functions in H by requiring that for all / e H fc^ 1 ^ = f'(z) 
then one can compute that 



2 



K\\ 2 - 



dsjj — T 

llfcJI 4 



\dz\ 2 . 



One can then define the Bergman style length of a curve 7, £bs(i), to be the length 
of 7 measured using dsn and can set 

8bs (x, y) = inf {£bs(i) '■ 7 a smooth curve connecting x to y} . 

We have defined 8 = 8h for such an H . We define 8 — Sh using (|6.1[) . We define 
8 = 8h by following Kobayashi's prescription. We have a map of X into the P{H) 
which sends x to p x = [k x ] ■ We use that map to we pull back the Fubini-Study 
metric on P{H) back to X and call the resulting metric 8. 



Proposition 9. 
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(1) For any smooth curve 7 
(2) 



fc(7)=W=<i(7) = ^»(7). 



(3) 

Vx, y € X, x ^ y,6(x, y) < 6* (x, y) . 

(4) 

Vx, y e X, x ^ y,S{x, y) < S* (x, y) . 

Proof. The proof in |MPSj of versions of the first two statements are based on 
the second order Taylor approximations to the kernel functions; hence those proofs 
apply here as does the discussion in [K] which shows that 5 is an inner metric. The 
third statement follows from the proof in [MPSj together with the fact that for 
any a, 6, c £ X we have the strict inequality 5 (a, c) < 5{a, b) + 6(b, c). The proof of 
that in |MPSj does not use the fact that if is a Bergman space, rather it uses the 
fact that S was obtained by pulling the Cayley metric back from projective space, 
which also holds in our context. The fourth statement also follows from the proof 
in [MPSj if we can establish the fact that for any a,b,c £ X we have the strict 
inequality S(a,c) < 5(a,b) + 5(b,c). We will obtain that from Proposition [5j We 
need to rule out the possibility that 

(7-2) \\P a - Pc\\ =\\Pa~ All + HA - Pell ■ 

The operator P a — P c is a rank two self adjoint operator. Hence hence it has a unit 
eigenvector, v, with 

||(P Q -P c )(«)|| = IIP* - Pc|| • 

For both of the two previous equalities to hold we must also have 

\\(P a - P b ) (v)\\ - ||Pa-Pf,||, \\(P b - P c ) (v)\\ = \\P b -P c \\ 

and hence v must also be an eigenvector of P a — P b and also of P b — P c . 

However in our analysis in Proposition [5] we saw that an eigenvector for an 
operator P x — P y must be in \J {k x , k y } , the span of k x and k y . Thus 

v E V {ka, k c } nV {ka, h} nV {h, k c } = {0} , 
a contradiction. □ 

7.2. The Berezin Transform and Lipschitz Estimates. Suppose A is a 
bounded linear map of H to itself. The Berezin transform of A is the scalar function 
defined on X by the formula 

A(x) — (^Ak x , k x ^j ■ 

For example, if P a is the orthogonal projection onto the span of k a then 

Pa{x) = l-5 2 {a,x). 

Also, recalling Proposition |3l we have the following. Suppose m,n G M(H) and 
that M and N are the associated multiplication operators on H. We then have 



MN*(x) = M(x)N{x) = m(x)n{x). 
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Coburn showed that the metric 6 is a natural tool for studying the smoothness 
of Berezin transforms. 

Proposition 10 (Coburn [CQ2] ). If A is a bounded linear operator on H, x, y € X 
then 

(7.3) \A(x)-A(y)\<2\\A\\S(x,y). 

Thus, also, if m G M(H) and M is the associated multiplication operator then 

(7.4) \m{x)-m(y)\<2\\M\\S(x,y). 

Estimate (|7.3p is sharp in the sense that given H, x, and y one can select A so that 
equality holds. 

PROOF. It is standard that if A is bounded and T is trace class then AT 
is trace class and |Trace(AT)| < ||A|| ||r|| Trace . Recall that P x is the orthogonal 
projection onto the span of k x . Direct computation shows r Tra,ce{AP x ) = A(x). 
Thus A{x) - A(y) < 2 ||A|| \\P X - Py\\ Trlicc . The proofs of ([731) and (7U) are then 
completed by taking note of Proposition [5j To see that the result is sharp evaluate 
both sides for the choice A = P x — P y . Details of those computation are in the proof 
of Proposition [5] □ 



Remark 7.1. By analysis of the two by two Pick matrix of M* one sees that (|7.4j) 
is not sharp. 

Suppose 7 : (0, 1) — > X is a continuous curve in X and that / is a function 
defined in a neighborhood of the curve. We define the variation of / along the curve 
to be 

Var(/) = sup jjj \f(U) - f(t i+1 )\ : < h < ■ ■ ■ < t n < 1, n = 1, 2, ... j . 

Corollary 1. With H,X,j,A as above: 

Var(A) <2\\A\\£ S (7) = \\A\\£ B s(7). 

7 

Proof. If we start with a sum estimating the variation of A and apply the 
previous proposition to each summand we obtain the first estimate. The second 
inequality follows from the first and Proposition [9] □ 

These issues are also studied when X has dimension greater than I and there is 
a rich relationship between the properties of Berezin transforms and the differential 
geometry associated with ds% s , [CO] . |CQ2j . [CL) . [EZ) . [EO] . [BO) . 

7.3. Limits Along Curves. For the most commonly considered examples of 
a RKHS any curve in X which leaves every compact subset of X has infinite length 
when measured by any of the length functions we have been considering. However 
this not always the case. For example, if X is the open unit disk and H is defined 
by the kernel function 

K{z,y) = — — 

l-yz 

then straightforward estimates show that along the positive axis 

1 /I 
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hence the curve [0, 1) has finite length. For more discussion of this see jMcj . This 
suggests there may be interesting limiting behavior as one traverses the curve. 
Suppose / is a function defined on 7 except at the endpoints. Straightforward 
analysis then shows that if Var 7 (/) < 00 then / has limiting values along 7 as one 
approaches the endpoints. Thus 

Corollary 2. Given H,X; suppose 7 : [0, 1) — > X and Ibs{i) < 00 then if to is 
any multiplier of H or, more generally, if A is the Berezin transform of any bounded 
operator on H, then these functions have limits along 7; 

31imm(7(i)); 31iml(7(t)). 
t->i t->i 

Furthermore there are choices of to or A for which the limits are not zero. 

This invites speculation that something similar might be true for H, however 
that situation is more complicated. If we rescale the space then we do not change 
5 and thus don't change the class of curves of finite length. However rescaling 
certainly can change the validity of statements that functions in the space have 
limits along certain curves. Thus the possibility of rescaling a space is an obstacle 
to having a result such as the previous corollary for functions in H. McCarthy 
showed in [Mcj that, in some circumstances, this is the only obstacle. We state a 
version of his result but will not include the proof. 

Proposition 11 (McCarthy |Mcj ) . Suppose H is a RKHS of holomorphic func- 
tions on the disk and that 7 : [0, 1) — > D and £bs(i) < 00. There is a holomorphic 
function G such that every function in the rescaled space GH has a limit along 7. 
That is 

3G, VheH, 31imG(7(i))/i(7(*)). 

Furthermore G can be chosen so that for some h € H this limit is not zero. 

McCarthy's work was part of an investigation of an interface between operator 
theory and differential geometry that goes back (at least) to the work of Cowen 
and Douglas |CD) . They showed that one could associate to certain operators a 
domain in the plane and a Hermitian holomorphic line bundle in such a way that 
the domain and curvature of the line bundle formed a complete unitary invariant 
for the operator. That is, two such operators are unitarily equivalent if and only if 
the domains in the plane agree and the curvatures on the line bundles agree. An 
alternative presentation of their approach yields RKHSs of holomorphic functions 
which satisfy certain additional conditions. In this viewpoint the statement about 
unitary equivalence becomes the statement that for certain pairs of RKHSs of holo- 
morphic functions, if their metrics 5 are the same then each space is a holomorphic 
rescaling of the other. The connection between the two viewpoints is that in terms 
of the kernel function of the RKHS, the curvature function at issue is 

d 2 

tC{z) = —^-^log k z (z). 
ozoz 

Thus K. or, equivalently 5, contains a large amount of operator theoretic information. 
However that information is not easy to access; which is why McCarthy's result is 
so nice and why these relations seem worth more study. 
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7.4. RKHS's With Complete Nevanlinna Pick Kernels. There is a class 
of RKHS's which are said to have complete Nevanlinna Pick kernels or complete 
NP kernels. The classical Hardy space is the simplest. The class is easy to define, 
we will do that in a moment, but the definition is not very informative. A great 
deal of work has been done in recent years studying this class of spaces. The book 
[AM] by Agler and McCarthy is a good source of information. In this and the 
next section we will see that for this special class of RKHS's the function 5 has 
additional properties. 

We will say that the RKHS H has a complete NP kernel if there are functions 
defined on X so that 

that is, if the function 1 — 1/K is positive semidefinitc. 

Of the spaces in our earlier list of examples the Hardy spaces, generalized 
Dirichlet spaces, and the Dirichlet space have complete NP kernels. This is clear 
for the Hardy space, for the generalized Dirichlet spaces if follows from using the 
Taylor series for 1/K, and for the Dirichlet space there is some subtlety involved 
in the verification. On the other hand neither the generalized Bergman spaces nor 
the Fock spaces have a complete NP kernel. 

Suppose H is a RKHS of functions on X and x, y £ X, x ^ y. Let G = G XiV be 
the multiplier of H of norm 1 which has G(x) = and subject to those conditions 
maximizes ReG(y). 

Proposition 12. 

ReG x , y (y) < S(x,y). 
If H has a complete NP kernel then 

ReG XtV (y) = 5(x,y). 

and G is given uniquely by 



G x , y (•) = 5 H (x, y'y 1 (l 



ky (xjk x (• 



k x \x)ky\') t 

Proof. [Sa, (5.9) pg 93] □ 

Remark 7.2. The multipliers of H form a commutative Banach algebra Ai. A 
classical metric on the spectrum of such an algebra, the Gleason metric, is given by 

S G (a,/3) = {supRca(M) : M € M, \\M\\ = 1,/3(M) =0}, 

[Gj . [Lj . |BW]. The points of X give rise to elements of the spectrum via x(G) = 
G(x). Thus if H has a complete NP kernel then 5 agrees with the Gleason metric: 

Vx, y e X, 5 G (x, y) = S(x, y). 



7.4.1. Generalized Blaschke Products. If H = Hi, the Hardy space, when we 
compute G we get 
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1-xy 



u - 



(l-\x\ 2 )(l-yz) 
(1 - yx) (1 - xz) 



xy 



jo 



y-x 
z — 



y - 



1 — yx 1 



xz 



1 — xz 

Thus G x . y is a single Blaschke factor which vanishes at x and is normalized to be 
positive at the base point y. 

Suppose now we have a RKHS H which has a complete NP kernel and let us 
suppose for convenience that it is a space of holomorphic functions on B. Suppose 
we are given a set S = {xi}°^ 1 C D and we want to find a function in H and/or 
M(H) whose zero set is exactly S. We could use the functions G just described and 
imitate the construction of a general Blaschke product from the individual Blaschke 
factors. That is, pick xq £ D \ S and consider the product 



(7.5) 



B(0 = B S , X0 (C) = II G ^o (0 



If the product converges then B (£) will be a multiplier of norm at most one and 
its zero set will be exactly S. 

The multiplier norm dominates the supremum so the factors in (17.51) have mod- 
ulus less than one. Hence the product either converges to a holomorphic function 
with zeros only at the points of S or the product diverges to the function which is 
identically zero. The same applies to the function B 2 (C)- We test the convergence 
of that product by evaluation at xq and, recalling that G ZiV (y) — Sh (z,y) , we see 



(7.6) Bl Xo (x a )=Y[6 2 (x l ,z ), 



(7.7) =n i 



l^o ( x ») I 

, j \ II^Koll \\kxi\\ , 



Recalling the conditions for absolute convergence of infinite products, we have 
established the following: 

Proposition 13. The generalized Blaschke product Bs :Xo converges to an element 
of M(H) of norm at most one and with zero set exactly S if and only if the following 
two equivalent conditions hold 

oo 

Y\_S 2 (x l ,x ) > 

i=l 

(7.8) y i fc ^' 2 <oo. 

fellMI IIM 

If the conditions do not hold then Bs tXa is the zero function. 

Remark: If 1 £ H then M(H) C H and the proposition gives conditions 
that insure that there is a function in H with zero set exactly S. 
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Corollary 3. A sufficient condition for the set S to be a zero set for the Hardy 
space, Hi, or of M(Hi) which is known to be the space of bounded analytic func- 
tions in the disk, is that 



(7-9) E ( 



1 - \x, \ 2 ) < 00. 



A sufficient condition for the set S to be a zero set for the Dirichlet space Hq or of 
M(H ) is that 



(7.10) Vlog — —5 <oo 



Proof. These are just the conclusions of the previous proposition applied 
to the Hardy space and the Dirichlet space with the choice of the origin for the 
basepoint. □ 



Remark 7.3. Condition (|7.9p is the Blaschkc condition which is well known to be 
necessary and sufficient for S to be the zero set of a function in Hi or H°°. 

The condition for S to be a zero set for the Dirichlet space was first given 
by Shapiro and Shields [SS] and the argument we gave descends from theirs. It 
is known that this condition is necessary and sufficient if S C (0, 1) but is not 
necessary in general. 

Remark 7.4. In the next subsection we note that any H with a complete NP kernel 
is related to a special space of functions in a complex ball. Using that relationship 
one checks easily that the convergence criteria in the Proposition is a property of 
the set S and is independent of the choice of xq. 

7.4.2. The Drury Arveson Hardy Space and Universal Realization. For n = 
1,2,... we let B™ denote the open unit ball in C™. We allow n — oo and interpret 
IB 00 to be the open unit ball of the one sided sequence space £ 2 (Z + ) . For each n 
we define the n— dimensional Drury Arveson Hardy space, D n to be the RKHS on 
B n with kernel function 

K n (x,y) = r. 

1 - {x, y) 

Thus when n = 1 we have the classical Hardy space. 

For each n the kernel function K is a complete NP kernel. The spaces D n 
are universal in the sense that any other RKHS with a complete NP kernel can be 
realized as a subspace of some D n . If if is a RKHS on X and H has a complete 
NP kernel then there is for some n, possibly infinite, a mapping 7 : X — ¥ B" and a 
nonvanishing function b defined on X so that Kh, the kernel function for the space 
H , is given by 

(7.11) K H (x,y) = b(x)b(y)KMx)Mv)) = , TTT^? ^ 

1 - (7(x),7(j/)) 

There is no claim of smoothness for 7. All this is presented in |AM| . 

The map 7 can be used to pull back the pseudohyperbolic metric from B n to 
produce a metric on X. First we recall the basic facts about the pseudohyperbolic 
metric B™ . Details about the construction of the metric and its properties can be 
found in |DW] ; the discussion there is for finite n but the rudimentary pieces of 
theory we need for infinite n follow easily from the same considerations. 
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The pseudohyperbolic metric p on the unit disk, B , can be described as 
follows. The disk possesses a transitive group of biholomorphic automorphisms, 
G = {4>a} a£ A ■ Given a pair of points z, w G B 1 select a <p a G G so that <j) a (z) = 0. 
The quantity |</i Q (u>)| can be shown to be independent of the choice of <j> a and we 
define p(z,w) = \(f> a (w) \ . 

In this form the construction generalizes to B™. The n— ball has a transitive 
group of biholomorphic automorphisms, Q = {0p}p eB ■ Given a pair of points 
z,w G B" select a 9p € Q so that Op(z) — 0. The quantity 1^ (w) | can be shown 
to be independent of the choice of Op and we define p n (z,w) = \0p(w)\ . The only 
difference, and that is hidden by our notation, is that now | • | denotes the Euclidean 
length of a vector rather than the modulus of a scalar. The function p n can be shown 
to be a metric and to have the expected properties including invariance under Q 
and having an induced inner metric p* that, up to a scalar factor, agrees with the 
distance induced by the Poincare-Bergman metric tensor. Particularly important 
for our purposes is that there is an analog of (|2.3p |DW[ pg 67]. For z,w EM n 

r719 , . (i-H 2 )(i-M 2 ) 2, s 

(7.12) 1 -2 = p n (z, w) . 

\1 - (z,w)\ 

An immediate consequence of the definition of S, the relationship (|7.11[) . and the 
identity (|7.12[) is that the metric S on X is the pull back of p n by 7. Put differently 
7 is an isometric map of (X,6h) into (B" , pmn) . In particular the S metric on the 
Drury-Arveson space is the pseudohyperbolic metric on B™ : 8d„ = Pb™ • 

8. Invariant Subspaces and Their Complements 

Suppose we are given RKHSs on a set X and linear maps between them. We 
would like to use the Ss on X to study the relation between the function spaces and 
to study the linear maps. The goal is broad and vague. Here we just report on a 
few very special cases. 

We will consider a RKHS H of functions on a set X, a closed multiplier invariant 
subspacc J of H ; that is we require that if j € J and m is a multiplier of H then 
mj € J. We will also consider the J- 1 , the orthogonal complement of J. The spaces J 
and J- 1 are RKHSs on X and we will be interested the the relationship between the 
metrics Sh, 8j, and Sj± . Because we are working with a subspace and its orthogonal 
complement there is a simple relation between the kernel functions. Let {k x } be the 
kernel function H, {j x } those for J and {j^} those for J ± . We then have, Vi e X 

(8.1) k x =j x +j£. 

In terms of P, the orthogonal projection of H onto J, and P x = I — P, we have 
3x — Pk x , jx — P^k x . 

8.1. The Hardy Space. We begin with Hi, the Hardy space. In that case 
there is a good description of the invariant subspaces, the computations go smoothly, 
and the resulting formulas are simple. If J is an invariant subspace then there is 
an inner function Gj so that J = QjHi. The kernel functions are, for z,w G D, 
are given by 

. eJ(z)Qj(w) 

3z{w) = . 

1 — zw 
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Thus if ®j{z)Qj{z') + then 

8 j [z,z') = 6-Hi {z,z') ■ 

In the other cases, by our convention, Sj is undefined. 

Taking into account the formula for j z and (|8.1[) we find 



1 — zw 



and hence 



(i-|ej(z)| a ) (i-\&j{w)\ 2 ) ( i_ N 2 )(1 _ H2) 



1 - 6 2 J± (z, w) = ^ — ^ g ^ v 11 . 

i-e, 7 (z)ejH |i-H' 

We can now use (|2 . 3j) on both fractions and continue with 

l-5j± (z,w) = 

Doing the algebra we obtain 



<5ji (z, iu) = 



p 2 (z, w )- P 2 (e J (z),e J H) 



In particular 



<P 



with equality holding if and only if Qj(z) — Qj(w). 

8.2. Triples of Points and the Shape Invariant. When we move away 
from the Hardy space computation becomes complicated. Suppose H is a RKHS 
on X with kernel functions {k x } and associated distance function S. Select distinct 
x,y, z € X. We consider the invariant subspace J of functions which vanish at x, 
the orthogonal complement of the span of k x . We will denote the kernel functions 
for J by {j z } . We want to compute 5j(y, z) in terms of other data. 

For any £ g X, j z equals minus the projection onto J 1 - of fcf . The space J 1 - 
is one dimensional and spanned by k x hence we can compute explicitly 



h 



ky(x) 



and there is a similar formula for j z . We will need | (j y ,j z )\ z 

2 



I (jv>3z)\ = 



1.2) 



\ky{ Z )Y 



\k y {x)k x {z)Y 



- 2Rek y (z) 



ky(x) 



k x (z). 



\k x \r " % 'ii**ir 

Recall that for any two distinct elements a, /3 of the set {x, y, z} we have 
(8.3) \k a (J3)\ 2 = \k (a)\ 2 = ||fc Q || 2 H^ll 2 (1 - 5 2 (a,P)) . 
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We use this in (|8.2|) to replace the quantities such as the ones on the left in (|8.3p 
with the one on the right. Also, we will write S^g rather than 5 2 (a,f3). For the 
same a, /3 we define Q a j3 and <fr a p with < a p < 7r, and (f> a /3 with < (f> a p < n by 

k a W) = (k a ,k p ) = \\k a \\ \\kf}\\ (cos^)e^ 

and we set 

T = cos 9 xy cos 9 yz cos 9 ZX cos {4> xy + y2 + ZiE ) . 
We now continue from (|8.2p with 

I (j y ,j z )f = IIM 2 IIU* (1 - 5%) + ||M 2 IIM* (1 - %y) (1 ~ O - 2 ||fc,|| 2 ||fc 2 || 2 T 



zy) 

= \\k£ \\k y f { (1 - 8%) + (1 - 5%) (1 - - 2T| 
Similar calculations give 



IliJ 2 = IIM 2 - = IIM 2 - IIM 2 (i - O = IIM 2 ^ 



\\U 2 = \\k z fS 2 xz 



Hence 



2 



I (jy,3z)\ 
\\jy\\ 2 \\jz\\ 2 

\\k z \\ 2 \\k y \\ 2 {(l - 5%) + (1 - gj (1 - - 2T} 
IIMX HMI^L 



1 

'"2/11 u xy 

gyga - (1 - gj - (1 - gj (1 - gg) + 2T 

g^+jL+j|^-2 + 2T 
<S 2 s 2 

^xy^xz 



Thus we can write the very symmetric formula 
(8.4) 5(y,z)Sj(y,z) = 



S 2 xy + S 2 xz +S 2 zy -2 + 2T 



8xy8xz8yz 

One reason for carrying this computation through is to note the appearance of 
T. This quantity, which is determined by the ordered triple {k x , k y , k z } and which is 
invariant under cyclic permutation of the three, is a classical invariant of projective 
and hyperbolic geometry. It is called the shape invariant. The triple determines 
an ordered set of three points in P{H) the projective space over H; p x = [k x ] , p y , 
and p z . Modulo some minor technicalities which we omit, one can regard the three 
as vertices of an oriented triangle T xyz . The edges of the triangle are the geodesies 
connecting the vertices and the surface is formed by the collection of all geodesies 
connecting points on the edges. In Euclidian space two triangles are congruent, one 
can be moved to the other by an action of the natural isometry group, if and only 
if the set of side lengths agree. That is not true in projective space. The correct 
statement there is that two triangles are congruent if and only if the three side 
lengths and the shape invariants match |BTj . jBRj . Using the natural geometric 
structure of complex projective space one can also define and compute the area of 
T xyz . It turns out, roughly, that once the side lengths are fixed then T determines 
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the area of T xyz and vice versa [HM]. Further discussion of the shape invariant is 
in [G3] and p5] . 

The reason for mentioning all this is that T was the one new term that appeared 
in (|8.4p and it is slightly complicated. The fact that this quantity has a life of its own 
in geometry suggests that perhaps the computations we are doing are somewhat 
natural and may lead somewhere interesting. 

In B0e's work on interpolating sequences in RKHS with complete NP kernel [B] 
(see also |Saj ) he makes computations similar in spirit and detail to the ones above. 
Informally, he is extracting analytic information from a geometric hypothesis. It is 
plausible that knowing how such techniques could be extended from three points 
to n points would allow substantial extension of B0e's results. 

8.3. Monotonicity Properties. We saw that when J was an invariant sub- 
space of the Hardy space Hi then for all x,y in the disk 

$j(x, y) = 5 Ul (x, y) > Sj± (x, y). 

It is not clear what, if any, general pattern or patterns this is an instance of. Here 
we give some observations and computations related to that question. 

8.3.1. Maximal Multipliers. Fix H and X. Recall Proposition [T2l given 

X, x ^ y we denoted by G x-y be the multiplier of H of norm 1 which has G(x) = 
and subject to those conditions maximizes ReG(y). The Proposition stated that 
ReG Xj y (y) < SH{x,y) and that equality sometimes held. If equality does hold we 
will say that x, y have an maximal multiplier and we will call G x ^ y the maximal 
multiplier. 

Proposition 14. If x, y € X have a maximal multiplier and J is any closed mul- 
tiplier invariant subspace of H then Sj(x,y) > 5ii(x,y). 

PROOF. Because J is a closed multiplier invariant subspace of H, the maximal 
multiplier G x>v is also a multiplier of J and has a norm, as a multiplier on J, at most 
one. Thus G XtV is a competitor in the extremal problem associated with applying 
Proposition [12] to J. Hence, by that proposition we have ReG^ (y) < Sj(x,y). On 
the other hand our hypothesis is that ReG x .j, (y) = 5h(x, y). □ 

If H has a complete NP kernel then every pair of points, x, y, has a maximal 
multiplier. Hence 

Corollary 4. If H has a complete NP kernel and J is any closed multiplier invariant 
subspace of H then for all x, y G X, 8j(x, y) > 5h{x, y)- 

The converse of the corollary is not true. Having a complete NP kernel is not a 
necessary condition in order for every pair of points to have an maximal multiplier; 
it is sufficient that the kernel have the scalar two point Pick property, see [AM, 
Ch. 6,7]. 

8.3.2. Spaces with Complete Nevanlinna Pick Kernels. Suppose that H is a 
RKHS on X with a complete NP kernel K(-, •). Suppose also, and this is for con- 
venience, that we have a distinguished point ui € X such that \/x € X, K(lo,x) = 
K(x,ui) — 1. The following information about invariant subspaces of H is due to 
McCullough and Trent [MT , further information is in |GRS] , 
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Proposition 15. Suppose J is a closed multiplier invariant subspace of H. There 
are multipliers {mi} so that the reproducing kernel for J is 

(8.5) Kj(x, y) = m t {x)m, l (y)^j K(x, y). 

Corollary 5. If H has a complete NP kernel and J is any closed multiplier invariant 
subspace of H then for all x, y G X 

Sj(x,y) > S H (x,y) > Sj±(x,y). 

Proof. We start with formula (|8.5|) which we rewrite for convenience as 

(8.6) Kj(x,y) = A(x,y)K(x,y). 

The first inequality is the statement of the previous corollary. Alternatively we 
could start from the previous equality and use (|5 . 1 [) to compare Sh and Sj yielding 
a quantitative version of the desired inequality. 

For the second inequality first note that Kj± = K—Kj = K—AK = (1 — A) K. 

(Note that for any x, A(x,x) < 1 because it is the ratio of the squared norms of 
two kernel functions and the one on top, being a projection of the one on bottom, 
has smaller norm. Also, by Cauchy-Schwarz, |^4(ir, y)| 2 < A(x,x)A(y,y) < 1. To 
rule out the case of equality note that if A(x, x) = 1 then k x £ J and hence every 
function in J- 1 vanishes at x which puts x outside the domain of 6j± . ) 

Recalling the formula for 5 we see that our claim will be established if we can 
show for the x, y £ X that are covered by the claim we have 

\l~A(x,y)\ 2 
(l-A(x,x))(l-A(y,y)) ~ 

We have 

|l->l( a: ,y)| 2 >(l-|A(x,y)|) a 

> (l-A(x,x) 1 ' 2 A(y,y) 1 '*) 2 

= 1 - 2A(x, x) x / 2 A(y, y) 1 ' 2 + A(x, x)A(y, y) 

> 1 - A(x, x) - A(y, y) + A(x, x)A(y, y) 
= (1-A(x,x))(l-A(y,y)). 

Here the passage from the first line to the second uses Cauchy-Schwarz, the passage 
from third to fourth uses the arithmetic mean, geometric mean inequality. □ 

8.3.3. Inequalities in the Other Direction; Bergman Type Spaces. Let H — H-2 
be the Bergman space. That is, H is the RKHS of holomorphic functions on the 
disk with reproducing kernel K (z, w) = (1 — wz)~ 2 . Let J be the invariant subspace 
consisting of all functions in H which vanish at the origin. 

Proposition 16. For all z,w G B 



h{z,w) < S H (z,w). 
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Proof. We have 

Kj(z,w) = K(z,w) 



(1 — wz) 2 
_ wz(2~wz) _ 2wz(l - ) 

(1 — wz) 2 (1 — wz) 2 
= B(z.w)K(z, w). 

At this stage we can see the difference between this situation and the one in the 
previous section. Here the ratio Kj / K is not a positive definite function. To finish 
we need to show 

\B{z,w)\ 2 >L 



B(z, z)B(w, w) 



Thus we need to show 



Equivalently, we need 



wz\ 2 > (2 — ww)(2 — zz) 



-4Rewz > —2ww — 2zz. 
This follows from the inequality between the arithmetic and geometric means. □ 

In fact this example is just the simplest case of a general pattern introduced 
in |HJSj and |MR] , In [MR] McCullough and Richter introduce a general class 
of RKHS which share many of the properties of the Bergman space. In particular 
their work covers the spaces T-L a , 1 < a < 2, and we will focus on that case. Suppose 
V is in invariant subspace of some T-L a , 1 < a < 2 and that V has index 1, that is 
dimF zV = 1. Let {k z } be the reproducing kernels for H a and {j z } be those for 
V. 

Proposition 17 (Corollary 0.8 of MR ). There is a function G 6 T~L a and a 
positive semidefinitc sesquianalytic function A(z,w) so that for z,w G D 



j z (w) = G{z)G(w)(l - zwA(z,w))k z (w). 

Corollary 6. 

5y(z,w) < 5-u a (z,w). 

Proof. The factors of G do not affect <5. After they are dropped the argument 
is then the same as in the proof of Corollary [S] 

|1 - zwA{z,w)\ 2 > (1 - \zwA(z,w)\f 

> (1 - \z\ A(z, z) 1/2 |w| A(w, w) 1/2 ) 2 

> 1 - 2 \z\ A(z, z) 1 ' 2 \w\ A(w, w) 1/2 + \z\ 2 A(z, zf' 2 |w| 2 A(w, w) 

> 1 - \z\ 2 A(z,z) - \w\ 2 A(w,w) + \z\ 2 A(z,z) \w\ 2 A(w, w) 

> (l~\z\ 2 A(z,z))(l-\w\ 2 A(w,w)) 

which is what is needed. 

Again, the passage from the first line to the second uses Cauchy-Schwarz, the 
passage from third to fourth uses the arithmetic mean, geometric mean inequality. 

□ 
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It is not clear in this context what happens with spaces of the form J . The 
following computational example suggests the story may be complicated. Let J be 
the invariant subspace of H = H2 consisting of functions / with / (0) = /' (0) = 0. 
The reproducing kernel for is 

Kj± {z, w) = 1 + 2wz. 

To compare 6j± with Su we compare the quantities 1 — 8 2 . If we are looking for an 
example where the results for spaces with complete NP kernels fails then we need 
to find z, w such that 

\i + mzf < (i-M 2 ) 2 (i-N 2 ) 2 

(l + 2H 2 ) (l + 2|z| 2 ) " \l-wzf 

This certainly does not hold if either \z\ or \w\ is close to 1 however things are 
different near the origin. Suppose z = — w = t > 0. The inequality we want is 

(1-2^ (1-t 2 ) 4 
(l + 2t 2 ) 2 " (1 + i 2 ) 4 ' 

This fails if t is close to one but we study it now for t near 0. In that case the left hand 
side is 1 - 8i 2 + 32i 4 - 96i 6 + O (t 8 ) and the other side is 1 - 8t 2 + 32< 4 - 88t 6 + O (t s ) . 
Hence for small t the inequality docs hold. 

Proposition 18. With H and J as described, if t is small and positive then 

6j±(t,-t) > 5 H (t,-t). 
For t near 1 the inequality is reversed. 



9. Questions 

We mentioned in the introduction that most questions in this area have not 
been studied. Here we mention a few specific questions which had our interest while 
preparing this paper and which indicate how little is know. 

1. Suppose H and H' are two RKHSs on the same X with distance functions S 
and 5'; and suppose further, in fact, that H and H' are the same spaces of functions 
with equivalent norms. What conclusions follow about 5 and 8'7 

2. In the other direction, what conclusion can one draw about the relation 
between H and H' if the identity map from (X, 6) to (X,5') is, say, a contraction 
or is bilipschitz? 

3. It seams plausible that there is a more complete story to be told related 
to Corollary [5l What is the full class of RKHS for which those conclusions hold? 
What assumptions beyond those conclusions are needed to insure that the space 
being considered has a complete NP kernel? 

4. Given X what metrics 5 can arise from a RKHS H on X. The question 
is extremely broad but notice that if you assume further that H must have a 
complete NP kernel then, by virtue of the realization theorem, a necessary and 
sufficient condition is that for some n there is an isometric map of (X, S) into 
(M n ,p) . Although that answer is perhaps not particularly intuitive it does give a 
condition that is purely geometric, none of the Hilbert space discussion survives. 
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